Semi-Automatic Query Expansion using Most Discriminant Words

نویسنده

Alessio Signorini

چکیده

Most casual users of IR systems type short queries. With current indexing technologies, short queries return enormous amount of results that may be impossible to examinate carefully. If users not find what they are looking for between the first 10/100 results, they may stop searching, losing the important results wrongly ranked by the search engine. While users generally know what they are looking for, the task of express their desires in a compact, precise, written form, i.e. the query, represent a real problem. Word usage is in fact both domain and user dependent, and may easily mislead the search engine. In this report I investigate a new method for query expansion, that exploiting the user’s feedback on some discriminant words, try to increase the focus on the user’s

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Creation and Maintenance of Query Expansion Rules

In an information retrieval system, a thesaurus can be used for query expansion, i.e. adding words to queries in order to improve recall. We propose a semi-automatic and interactive approach for the creation and maintenance of domain-specific thesauri for query expansion. Domain-specific thesauri are especially required in highly technical domains where the use of general thesauri for query exp...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Annotation and verification of sense pools in OntoNotes

The paper describes the OntoNotes, a multilingual (English, Chinese and Arabic) corpus with large-scale semantic annotations, including predicate-argument structure, word senses, ontology linking, and coreference. The underlying semantic model of OntoNotes involves word senses that are grouped into so-called sense pools, i.e., sets of near-synonymous senses of words. Such information is useful ...

متن کامل

English-Japanese Cross-lingual Query Expansion Using Random Indexing of Aligned Bilingual Text Data

Vector space models can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. In this paper, we report on our NTCIR 2002 experiments using the Random Indexing vector space method for extracting an English-Japanese cross-lingual thesaurus from aligned English-Japanese bilingual data. The crosslingual thesaurus has been used for automatic...

متن کامل

Automatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model

This paper describes the experimentation conducted to test the effectiveness of automatic query expansion and word sense disambiguation (WSD) using short and long query of a topic TREC under vector model. We ran different experiments generating queries under vector model using linguistic information extracted from WordNet. Results show that query expansion with short queries and long queries is...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Semi-Automatic Query Expansion using Most Discriminant Words

نویسنده

چکیده

منابع مشابه

Creation and Maintenance of Query Expansion Rules

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Annotation and verification of sense pools in OntoNotes

English-Japanese Cross-lingual Query Expansion Using Random Indexing of Aligned Bilingual Text Data

Automatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model

عنوان ژورنال:

اشتراک گذاری